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FIFI DQFTHE INVEMTION 



s The present invention relates to fusion proteins and a process for preparing fusion proteins. Th* jnvenlio^ also 

pertains to various oligonucleotide and amino add sequences which make up proteins of the present imwrtlon. 

aAnifftRm IMP OF THE INVENmON 

to Proteins, which In addition to the desired protein, also have an undesirable constituent or ^^fj^j" 
the end product are referred toastuelonprotein8.Whenprotelns are prepared by genetic engineering, the 

Z!!^ti!^!^9f6tBin is utilized particularly if. in direct expression, the desired protein « decomposed relatwely 

raoWlv by host-endogenous proteases, causing reduced or entirely inadequate yields of the desired protein 

^ T^e nS^udeSthe balL constituent of the fusion proteh Is usualty setected 'n»|^«™[!"f^ '^'^^ 
I* fusion^protef fe obtained. This hsdubilfty not onV provkles the desired protection ^B^rjl^r^^ 

teases but also permits easy separation from the soluble cell components. It Is usually accepted that the PK^tonf 

S^^JiplSefusL^p'Zn^relativefy^^^ 

■me priaratlonoffuslon proteins withashott ballast constltuem has bee^ For example, ageneluston 

was prJSrShteh codes f<J a fusion protein from the first ten amho acids of p-galactosWase and somat^teBn. 
20 llS^^Trt^L^^tthlsshortLnoacidchardldn 

nnfiitien hv the host-endogenous proteases (US-A 4 366 246. Column 15, Paragraph 2). 

e?-aT^ (S^ 0 292 763. we Know of fusion proteins, the baltast constituent of which consete d a p- 

gala^uLe fragment with more than 250 amino acWs. These fusion proteins are Insoluble, but they can easily be 

« "^uTf-Crnl'^Iii'be^^ribedln^ 

aspZ^Ts^isateboriousprocedureandoltenresultslnfuslonp^^^^^ 

chSeristics Thus, a need exists for an efficient process for producing fusion ptoteivi with a ruimber of attraclwe 
tSSlSproSse restetance. proper IdWHg, and etrecllve cleavage o» the ballast Irom the desired proieh. 

so SIIMMAFIY OF T HE INVENTION 

The present invention relates to a process for the pnpaiation ol fusion proteins. P;;^'^^ 
im^ention coniHln a desired protein and a ballast constituent. The process ol the present ftwertlon ^^"^^'"a 
Z^^^Xf^ library (Lure) coding lor ballast constttuents. insertng the mixed oligonucleol.de (^branr) nto a 
» ;iS?^thattheollg«2cleolldelsfuncdonalVllnkedtoaregulato^ 

^ red protein.^ transfomihg host cells with the so«btained vector populatton. Transfomante are then se- 

amlnoLStewWch alk»^ an easy cleavaged 
'''ZS?n2;.?atoTe^^^^^ 

easily be sdublllzed. Fuston proteins of the present Inventtan thus futflll the requirements established lor protease 



'^'Smwre. ollgonucleotWe of the present biventnn may be designed so that the ballast constituent does not 
4S Interfere with fowmg of the desired protein. 

pai |:F nPSCRIPTION OF THg DRAWINQS 

Figure 1 and its continuatton in Figure la and Figure 1 b show the constructfon of plasmid populatton (genebank) 
60 pm^ from the known ptemid pHl54«5« via plasmW plNT40. Other constiuctlons hawe not been graphically pre- 
sented because they are readily apparent from the figures. 

Figure 2 is a map ol plasmW pUHIO containing the complete HMQ CoA reductase gone. 
Figures 3 and 3a show oonslnicttan ol pIM^ a plasmid containing the mW-proinsulin gene. 

55 DETAILED DESCRIPTION O f THE INVENTION 

The Invention relates to a process for the preparation of a luslon protein characterized In *f<«">^«*fl?""JJ'^ 
otida is constnjcted whksh codes for the baiiast constituent of the fusion pioteia The oligonudeotidB mixture » intro- 
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duced in a vector in such a manner that it is lunctlonally linkad to a regulatory region and to the structural gene for tt^ 
pr«S ippraprlate hoet cells are transformed with the ptosmW population obtained In thte manner, and the 
T^pSngaZ ol coded fueior protein are setecled. Advantageous emtxxliments o1 th.s nventon are 

5 ''"ilel.lSrcleotide advantageously codes at the 3--end an amino acid or a group 

or Dsnnlt ^ and prelerabty enzymatic cleavage ol the ballast constituent from the desired protela AcconJIno to 

TrSe sSreT^rttoular. an oligonucleotide is preferably constructed whfch codes for a ballast constrtuent that 
""T^iCrlsTngrorselrelt.^^ 

onlv fulfill IheTequirements established lor protease resistance, but are also produced at a high expression rate and 
Td^esZ meZ^ pSe^ l» Insoluble, ci, easily be rendered sduble. In the dissoh,ed »^ "^jf^'J^^^ 
LlS^tituentao^rdingtotheinvemionthenpemiitsaetericallyfavof^ 
that it can be property folded and easily separated from the ballast constituent 

If Src^irSoSuiin is fom«d r a prT>form, the ballast constituent can be constituted .n such a manner that As 
Cleavage can occur concomitantly with the iranstormatton of the pro^jroteln imo the mature P^^'"^" ^^I'Sme 
3or example, the ballast constltoent and the C chain can be removed simultaneous^, yte^^^ 
20 mature insulin Which can be transformed into insulin Without any Side reactions involving much loss. 

iJe ^irtSlSt constttuent accordhQ to the Invention is actually shorter than the .»ual sflnal sequen^ ^ 
proteins does not disturb the foWing of the desired pralein. It therefore need not be eliminated prior to the final 

orocessino step yieldlna the mature protein. ^ / ii.^„»,a„,4\ 

The cSgonucleolidB coding for the ballast conslitusnt preferably containB the DNA sequence (coding strand) 

(DCD), 



in which D stands for A. G or T and X is 4-1 2, preferably 4-8. 
30 in particular, the oligonucleotide is characterized by the DMA sequence (coding strand) 

ATC (DCD)y (NNN), 

as in which N In the NNN triplet stands tor Idemical or dtll^rent nucleotides, excluding stop "dons, z is 1-4 «wj /« J« 
e-lt^eferably 6-10. wherein y is at least 4. It has proved advantageous for the oligonucleotide to have the DNA 
sequence (coding strand) 
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66 



ATG (OCD)5.a (NNM) 
especially If it has the DNA sequence (coding strand) 

ATG GCW (DCD)^.^ CGW 

or, advantageously 

ATG GCA (DCD)4.7 CGW 

Iii^lSint!oIi2 5n A model sequences fulfill all of these requirements. Codon DCD^ 
serine, threonine and alanine and therefore lor a relatively hydrophlllc protein chain. Stop cod^s are exdi^^^ 
selection of the amino adds remains within manageable scope. The following is a particularly prelerable embodiment 
of the DNA sequence for the ballast constituent, eepecially if the desired protein is proinsuiin: 
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ATG GCW (DCD)y, ACG CGW 



ATG GCD (DCD)y, ACG CGT 
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Kj^^^^^t^^^ regulation sequence ends with CC. The next to last triplet codes tor thr«)nlne and. 
tcShe^r*^^^ cS^ ter^nn^^ rjresents the recognitton sequence for r-^«^ <^ 
auentlv this oligonucleotide can be easily and unambiguously incoiporaled n gene constructwns. 

^?;TNNNU^roup codes in the ? position for an amino add or a gmup of amino acds mat pemirts sirnple. and 
nr-fl^hlTS^te sS^ion o1 the 

t'SflSSTtS; O^prsSh a manner that at the 3'^nd they code the cleavage site of a rertrtctton enzyme 

liTcS^l liitogS S*! gene lor the desired protein. It is also advantageous lor the ATG stan codon 

tT^ Z^^ Zi DCD triplet toS incorporated into the recognition eequence of a restriction enzyme so that 

ttlZ^^^^ constltuem accortmg to the Irwentlon can easily be Inserted in the ^^^^^^ 
the 9ene tor vie ^^^^^^ (enzymatc) cteavaga of the 

fusio^oriXoJmL«J le?e^Lpa8sescodons.fore^^ 

^S^T^^T^^^Z!^^y(^. In general, the upper limit tor the sum of y and z is 12. emce the baUast 
S^iiwrt^r^briSaspossible and. above all. not Interfere with the toWlngofIhe desired p«tein. 

^S^lS^ aSSiS. bacteria or^ eukaryotic cells such as yeasts are preferred as the h«* or^am n 
oenTfc ^g^r^g^J^ Prided that higher^«ms are not squired. In these P««~ses the^re««n 
rt^fheteraSJaTe te re^uteted by a homotogous regulatory regton. Le. one that is mtrhsic to the ^ojor corn- 
!Si tShS^^ll If a?re-peptSe Is expressed, it often occurs that the pre^uence is also heterotogous to 
tCS^^nnprSl^VlsSnT^^^^ 

sSSn^rtaliSC^^^^^^ -ent«n is adapted to «s envinximent. *. selection p«c«» according 
aa to the invention vields a DNAconatructkjnchafacterized by thia "sequence hamriony. ^ ^ . _ 

^r^SmlngaSd endof thebdlasl^^^ "^*'^'"^^„''^ f T rft^i 

an aS^ S or a ^p o1 amho ackls that permB the desired separation of the ballartocnstrtuent rem 
o^e^ to rtSe Jjl. tor example, the desired protein IS prolnsulin. M 

sSeded^ the last codon as this pemiits the particulaily favorable simultaneous cleavng off of fte baHaet 
SSShmer!^^Cchah.Scou«^^ 

r^jjaisr^iniaiiowsa*^^ 

''''^^L^l^^e amino acid sequence Should be ae Short aa possible so that folding 
affeJ2S^TJ.^h^n«houS'bereteth,eVhydr^^ 

JSSSs aS^ ttislon proteki remahs soluble. Cysteine residues are undesirable since they can Interfere with the 
'""^for^rnrtorm^Tart 

inaIl!^reSlS,pla8mldlminedla.eVh»«rtfoflhestmctum^ 
loZTl th^^iZntained in thte manner. Appropriate gene structure, can be obtained « th» way by the 
4S selection ol bacterial clonee that produce corresponding fusion proteins. ^ u«„i„„i,„ and and of the 

It WHS Diaviouslv mentioned that the cleavage sites for the restnctnn enzymes at the beginnrg and end of the 
nuclScZ SS^i^Sjrthe ballast consSuent a« to be regarded as examples on^. Recognition sequence. 
thatt^s^tSSSa and in which any nucl^^ 
aSs^^^ydex^ple.alsothoseforreslrictionenzymesAfllll.Ndel.Nlalll.^^^ 

«> rm^-mZ^.e.sto'beattheendoftheb^^seque^^ 

addHionat appropriate restriction enzymes can also be found here for use nstead of Mlul, i.e.. Nrul. Avrll. Afliii. ciai 

*^"SL«r Hisalsoadvantageoustousea-polymerase chain reactionVPCR)accordingtoS^^^ 

239:S?^ S^^^ensewwtri^ 
55 itwas previously indicated that limitation to the DMA sequence {DCD)k is lor reasons o^ expediency and that thi. 
doernTmKW^srcodonssuchas.forexample.thoseforg^ine.proline. ^sina. 

T^e Zt rtflclent embodiment of this DNA sequence is obtained by selection of good produce™ ol »te ftislan 
protean L^tel^SfSel^^ 
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ballast sequanca and desired protein, as a result of which unfavorable combinations of promoter, ballast sequence 
and structural gene are avoided and good results are oMahed with minimum expenditure In lem» of the above-men- 
tioned "sequence harmony'. 

Surprisingly, It was observed that the genes optimized tor the ballast constituent according to the invention do not 
always contain the triplets preferred bv E.coli. 11 was found thai for Thr, codon ACA. which is used least f requentV by 
E coll actually occurs frequently in the selected sequences. If. for example, the following amino acid sequence were 
^Sirtzed according to the preferred codon usage (p.c.u.) by Econ (p.c.u.: Aota. S. et al., Nucl8ic Acids Research 16 
(supplameni): r315. r316, r391. r402 (1988)). we would obtain a totally different gene stmcture than that obtained 
according to the invention (Cf .Table 1 ): 

Ala Thr Thr s«r Thr Ala Thr Thr 

CCC ACC ACC ACC ACC CCC ACC ACC p.c.u. 

GCJl ACa ACA TCA ACA CCA ACT ACO invantion 



In the case of the fusion proteins wrth a proinsulin constituent, the inHial starting point was a ballast constituent 
with 10 amho acids. The DNA sequence o1 the best producer then senwd as the base fOr variations In this sequence, 
whereupon It was noted that up to 3 amino acids can be eliminated without a noticeable loss In the relative expression 

20 rate This finding is not only surprising, since it was unexpected that such a short baltast protein would be adequate, 
but also very advantageous since of course the relative proportion of proinsulin in the fusion protein Increases as the 
ballast constiluent decreases. 

The sloilflcance of the ballast constituent in the protein Is apparent from the following comparison: 
Human proinsulin contars 86 amino acids. If. for a lusion protein according to EP-A 0 290 005. we take the lower limit 

tB of 250 amino acids for the ballast constituent, the fusion protein has 336 amino acids, only about one quarter of which 
occur in the desired protein. By comparison, a fusion piotein according to the Invention with only 7 amino acids In the 
ballast constituoni has 93 amino acids, the proinsulin constituent amounts to 92.5%. If the desired proten has many 
more amino acids than the proinsulin, the relationship between ballast and desired protein becomes even more fawo- 
r&bl6 

so It has been mentioned on a number of occasions that as a desired protein proinsulin represents only one preferred 
embcdimem of the invention. However, the Invention also works wtth much larger lusion proteins fdr which a fusion 
protein with the active domain of human 3^ydroxy^ethylglutaiyl-coenzyme A-reductase (HMG) » mentioned as 
an example This protein contains 481 amino acids. A gene coding for the latter is known e.g. from EP-A 292 803. 
Having now generally deecribed the inventkxi. the same will be more readily understood through reference to the 

3S foltowing exanples which are provWed by way ol illustratkxi. and are not intended to be limiting of the present mventfcn. 
unless specified. 

Example 1 

40 Constructton of the gene bank and selectton of a ckxie with higji expresskxi 

If not othenwlse Indteated, ail media are prepared according to Manlatis, T.; Fritsch, E. F. and Sambrook, J.: Mo- 
lecular Cloning. CoW Spring Harbor Laboratory (1 982). TP medium consists of M9CA medium but with a glucose and 
casamho acid content of 0.4% each. If not othenwse indicated, all media contain 50 \ig/m\ amplcillin. Bacterial growth 
4S during tennentatton Is determHed by measurement of the optical density of the cultures at 600 nm (OD). Percentage 
data refer to wei^t if no olher data is reported. 

The starthg material Is plasmid pH154/25* (figure 1 ). which Is known from EP-A 0 211 299 heroin Incorporated by 
reference. This plasmkl contains a fusion protein gene (D*-Proin) linked to a tipijromoter and a reaislanca gene for 
resistance against the antbiotic ampteiirr (Amp). The f uson protein gene codes a fusran protein that contans a frag- 
so ment of the ttpD-proteh from E. coll (D') and monkey proinsulin (Proin). The gene structure of the plasmW results In 
a polycistronk! mIRNA, which codes for both the fusion protein and the resistance gene product. To suppress the for- 
mation of excess resistance gene product. Initially the (commercial) trp-transcrlptkjn terminator sequence (trpTer) (2) 
Is introduced between the two structural genes. To do ao^ the plasmki Is opened with EcoRI and the protrnding ends 
are filled In wtth Klenow polymerase. The resulting DNA fragmem with blunt ends Is 

» (2) 
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5 ' AGCCCGCCTWVTGAGCGGGCTTTTTTTTS ' 
3"PCG6GCGGATTXCTC6CCCGAAAAAAAA5' (2) 

which resuKs in plasmid piNT12 (figure 1-(3)). „ 3 Hindill- 

cleavage site in Ihe carboxytemiml °' '^J^'P^'^f ^^^^ J^^^ f«xn which the one 

pl^fT12. By cutting the .^l^T ^^SnSAp JglJ aS STs st^ctured in an anal- 

MM T? . lElie 1«^ent (figure MS)) is Hgat*! with mixed ollgnonucleotKle (9) 

5 'CATGGCDDCDDCDDCD0CDDCDDCDA3 ' 

3'CGHHGHHGHHGHHGHHGHHGHTCCGC5' (9) 



30 



to 



TTCGGGTACCGHHGHHGHHGHHGHHGHHGHTGCGCAGS ' 
TTGCCCATGGC3 ' 

disHi:"r;ljS!.gbactert^clonesareteJe^ 
atliuem. For thi. purpose. ««rnigM J^^l^^g^^^ 

cultures are mixed with 10.5 mi -J^^^";-" ^^.^^^^^ adcW and the preparation Is shaken for 

3-p-^ryllcac«(lM).a^^^^ 

and 4% m 2.mercaptoethanol). The suspension te ►'^^^ "^J'^sD^^ bacteria that 

,0 reduce viscosity and """^JIl^V^^^^^"^^^ evident that one of the 
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Table 1 

2 3 4 5 6 7 8 9 10 11 pINT 

Met Ala Thr Thr Ser Thr Ala Thr Thr Arg 

ATG GCA ACA ACA TCA ACA GCA ACT ACG CGT 41 

Thr ser Thr 

*** *** *** **6 A*T T*G A*G **G *** *** 42 

Ala Thr Ser Thr Ser 
**T G** *** A*T T*T A*T T*A *** *** 43 



Asn Ser 

*** *** *** *** *** AAC T*A *** *** «0 



*** *** *** *** *** *** *** *** *** 

**« *** *** **• *** **• *** *** 



* **A 69d,72d 



*** *** *** *** *** ** 



Gly Asn Ser Ala 
**« *** *** *** *** *** *G* *A* T** GCA **A 90d,91d 



Lys — — — 
*** *** *** *** *** *** AA* **A S3d 



Pro 

*** " 



*** i,** *** *** *** C** **A 94d 



Met 

*** *** *** *** *** *** ATG **A 
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Gly 

*** *** *** *** *** *G* **^ 



Example 2 

Selection a! additional clones 
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31-35. 19S3) m used. TP-agar ^^J^'^^^ ^ ^ coated with a 2-nim thick TP top agar layer, a 



Exannple3 

Preparation at plasmid plNT41d 



sequence of which ie reproduced in Table 2. 
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Table 2: DNA-Sequence of Plasnid pINT41d 

50 



10 30 



GTGTCATGGTCOSTGATCGCCAGGGTGCCGACGCGCATCTCGACTTGCACGGTGCACCAA 

90 

VATCGGAAGCTGTGGTATGGCTGTGCAC 

150 170 



90 

TGCTTCTGGCGTCAGGO^GCCATCGGAAGCTGTGGTATGGCTGTGCAGGTC^^ 



CATAATOCGTGTCGCrCAAGGCGCACTCCCGTTCTGGATAATGTT^ 
310 330 350 



TGCATAATTCGTGTt;ijtrr^AAwv.ww*%^ * — 

210 

TCTGGCAAATATTCTGi 

TAACTAG?i?GCAAGTTCACGTAAAAAJJCT^^^ 



tcataacStoctggouuitattctga^tc 



ctacgcgSpcgtgaaccagcacctgtgcggctcccacctagtggaagctct^^ 

*tcin 4iu 



370 390 



tgtgcggggagcgaggcttcttctacacacccaagacccgccgggaggcagaggacc^^ 
aggtggggcaggtggagctgggcggggSc?ctggcgcaggcagcctgcagc^^ 

510 

cagaagcgcggcatcgtggagcagtgctgcacca 

570 

AACTACTGCAACTAATAGTCGACCTTTGCTTTCfl 

630 

AGAATTAGCCCGCCTAATGAGCGGGCTTTTTTri 
690 

'gatacgcctatttttataggttaatgtcatgat; 

750 

.okrwrreCGGGAAATGTGCGCGGAACCCCTAT] 
810 



490 

IGTCC 
550 
^GCTC 
610 

:aaa( 

CGAAAGGGCCTCGTGATACGCCTATTT^SiTAGGTTAATGTCATGATAAT^^ 



TGGAGGGGTCCCTGCAGAAGCGCGGCATCGTGGAGCAGTGCTGCACCAG^^^ 

tctaccagctggagaactactgcaactSWgacctttgcti^c^^ 
agctgtcaaacatgagaattagcccgcctaatgagcgggcttt^ 

690 



tagacgtSggtcgcacttttcggggaa^tctgcgcggaacccctatttg 

_ 810 oJU 

taaatacattcaaatatgtatccgctcatgagacaataaccctoataaatg 

c/* 870 

i^GGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATOCCCT^^ 

930 ^^^^ 
TTGCCTTCCTGTTTTTGCTCACCCAGAAACGCrGGTGAAAGTAAAAGATGCT 

•70 990 1010 

GAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCG^^ 

1 niA 1050 lOVO 

COTGAGA6TTTTCGCCCCGAAGAACX;TTTTCX»AT6ATGA^ 

inQf\ 1110 HwU 

TGTGGCGCGGTAmTCCCGTGTTGACGCCGGGCAAGAGCAACTCGGTCGCC^^ 

1150 1170 1"° 

?caga; 

ATGACAGTA^GAGAATTATGCAGTGC^aC^ATAACOlTGAGTGATAACACT^^ 

TTACTTCTGACAACGATCGCAGGACCGAAGGAGCTAACCGCTTTTTTGCA^ 

1330 1350 13/0 

GATCATGTAACTCGCCTTGATCGTTGGGAACCG6AGCTGAATGAAGCCATACCAAACGAC 

GAGCGTGACACCACGATGCCTGCAGCAATCGCAACAACGTTGCGCAAACT^ 



850 

tattgaaaaaggaag; 

910 



tattctcagaatgacttggttgagtactcaccagtcacagaaaago^totacggatggc 

1230 IZDO 
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lASft 1470 
GAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATC^^^ 

GCAGGAC^i^TTCTGCGCTCGGCCCrilcSGCTGGCTGGTTTATTGCTGAT^^ 



gccggtcagcgtgggtctcgcggtatcaS^^ 

;gagtc 

1710 

atcgctgagataggtgcctcactgattaagcattggtaactgtcagaccj^gtttactca 



1690 
?GAGA 
1750 



cgtatcotaSttatctacacgacggggagtcaggcaac^^ 

rraagcattggtaactgtcagaccaagi 

1770 ^''90 

tatataSJagattgatttaaaacttcatttttaatttaaaaggatctag^^ 



X830 

CTTTTTGAiivTCTCATGACCAAAATCCCOT^ 

GACCCCGJISAAAAGATCAAAGGATciJ??TGAGATCCTTT™ 

1950 ±Tftu 

:aaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagcta 



1930 1950 
TGCTTGCAAACmlt*™**^"'-*-"— 2010 2030 



CCAACrCOTTTCCGAAGGTAACrGG^^GCAGAGCGCAGATACCAAATACTC^ 

cta6TGtJI?cgtagttaggccaccacotcaag 

GCTCTGOTi^TCCTGTTACaGTGGCTGCTC^ 

TTGGACTS^ACGATAGTTACCGGTiiwCGCAGCGGTCGGGCTGAACGGG^ 

GCACAciacScAGCTTGGAGCGAACGACCTACAWGAACTGAGATACCTA^^^ 
2290 2310 



ATTGACAAAGCGCOVCGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCA 

235C 
6G6TCGGAAC 



2350 2370 

AGGAGAGCGCACGAGG6AGCTTCCAGGGGGAAACGCCTGGTA 

'i'4"lO 2430 2450 



GTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATrPTTGTC^ 

2470 2490 2510 



GGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCT 



?TTTTACGGTTCCTGGCCTTTTGCT 



•juia 2550 2570 

GGCCTTTTGCTCACATGTGTCAGAGGTTTTCACCGTCATCAC0GAAACGCGCGA6GCAGC 

TG 



ExamoteA 

Fermsnialion and processing of plNT4l*tu9lon protein 
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the mixture is asain stirred fcjr thirty minutes at SPC. 

H^«tnor«ohv The entire batch of (iii) ie diiuted with 250 ml buffer A (50 mM glycine. 
{w)DEAE-Aniori exchange chromatography. -me enweow^ 

pH 9.0) and placed or, a chromatography ^^J^ ^ P«'«'"* 

ml! column diameter 26 B (50 mM glycine ol pH 9.0. 

suilonaie is eiuted with a '^I^^'^^J^SI^bSS^^^^ <u«ion p^teh-S^uHonate are then 
3MureaandlMNaCI)ataftawratBO»9mWninuta.THerracnon8«inuiin y 

combined. 

(v)F.dingandenzvmat™Tha«^^^^^ 

rr;ret;Kar-^^^ 

S continued lor another 4 hours a, ^'^^^S^XCeTSJ^i?^^ now has pH 8 is d^ested 
acid(HEPES)i8addedloanendconce,^trat«nomgp^^^ 

wtthtryp8ln«2S-C.Duringthepr«e8S.thee^.nea.nc^^^ 

insulin with cartwxypeplidase B. 
Examples 

Construction of plasmld pINTfiO 

The ollgonucleoiWe InsuB 

TTCGGGTACCGTTGTTGTAGTTTGAGTTGCGCAG 5' 
TTGCCCATGGC 3' 

gxamolee 

Construction of plaBmId plNT67d 

P^sm« p.NT67d is a derivative C piNT4ld ^ wj«=h the codon ^l^J f^lj^:^^^^^^^^ 
sequence is Jeteted. That . why. iIKe ^^^f^^^^^!^^ "SS^rT^^fo, thfe purpose, two 
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TIR: 5'-CTG AAA TGA GCT GTT GAC-3' 

and 

DTR8: 5'-CAC AAA TCG AGT TGC TGT TGA TGT TGT-3' 

or 

DTR9: 5'-ACA GCA ACT CGA TTT 6TG AAC CAG CAC-3' 

and 

insull; 5'-TCA TGT TTG ACA GCT TAT CAT-3'. 

Thia nrcducas two f raqmento thai are partially complemerrtaiy to each other and when annealed with wch other code 

riiS^^n^^ a8rNT41d in Which, however, the amino acid In poaltlon nine la absent. For compWIon. 

a Pf'^^l'^Vf " hi-ctad to another PCR together with the oligonucleolides TIR and Insull. From 

! :SS^S^^ ^tfn^S^^ 'nsulln precu^ Is liberated w«h Nco and Sail 

'rplJi^S'p^SlJrCiSenalsocutwnhthesetwoen^^^ 

Hoaled with the stmctural gene Iragment from the PCR to yield pla8midplNT67d. 

^ Cm«S^de and amino aciJeequences lor the ballast region h«/e been reproduced mi&ble 1 . 

Example? 

Construction of plasmid plNT68d 

Like plasmid plNT67d. plasmid plNT68d is a shortened derivative of plasmid plNT4ld in which the «»donBof Uie 

twoinTnSnpMltlonselgM^^^^ 

d only eight amino ac«s. The p»cedure pr.viou.ly d.«:ribed r Example 6 » used for rt, 

• construction but with the two ollgnonucleotide pairs 

TIR: 5'-CTG AAA TGA GCT GTT GAC-3' 

and 

DTRIO: 5'-CAC AAA TCG TGC TGT TGA TGT TGT TGC-3' 
or 

DTRll: 5'-TCA ACA GCA CGA TTT GTG AAC CAG CAC-3' 

and 

Insull: 5'-TCA TGT TTG ACA GCT TAT CAT-3' . 
The nucleotide and amino acid sequences tor the ballast region have been reproduced Inl&ble 1 . 

Construction of plasmid plNT69d 

Plasmid plNT69d Is also a shortened derivative of plasmid plNT4ld in which, however, lha oodon. of the three 

an^no ikS In pos«tona seven, eight and nine a. the batest ^^"^^SJrS'^^^iL^^ 

insulin precursor with a ballast sequence of only seven ammo acids. Ttie procedure descrbad n Example 6 » also 

used ttor tts constnjctlon but with the two oligonucleotide pairs 
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Tho 



TIR: 5'-CTG AAA TGA GCT GTT GAC-3' 

and 

DTR12: 5'-CAC AAA TCG TGT TGA TGT TGT TGC CAT-3' 
or 

DTR13: 5'-ACA TCA ACA CGA TTT GT6 AAC CAG CAC-3' 
and 

insull: 5' -TCA TGT TTG ACA GCT TAT CAT-3' . 
nucleotide and amino ackJ sequences lor me ballast region have been reproduced Iniabte 1. 



ExarriDle9 

Construction til plasmid plNT72d 

Plasmid plNT72d i* a derivative ol plasmid pi KfTBSd in which the entiiB C^iaptide gene region, «*[^f t*"" 
0. theHST^lie ammo acid arginine. is deleted. Consequent^r. thte results ^, a :;;;J:]'P'»'™^^^ 
Sia^Jn^e^ueinsteadolaC^haiaWrth plasmid plNT69dasastart^ 
8 Is also used for Its constnjctlon but with the two oligonucleotide pairs 

TIR: 5'-CT6 AAA TGA GCT GTT GAC-3' 

and 

Insu28: S'-GAT GCC GCG GOT CTT GGG TGT-3 ' 
or 

Insu27: S'-AAG ACC CGC 6GC ATC GTG GAG- 3' 

and 

Insull: 5' -TCA TGT TTG ACA GCT TAT CAT-3'. 

ExamolelO 

Construction of plaamids (ANUdd. plNT88d and plNT89d 

Plasmid plNT79d Is aderlvatlwe ol plasmid pINTBSd (Example 8). In wtilchthe Insulin precursor gene tearranged 
lv««mrinsuccessico. The plasmid the^ 
i5d.^orrconetruc»ion.aPCRn«lion is earned ou^ 

InBU29: 5'-CTA 6TA CTC GAG TTC AC-3' 



and 

Insull: 5' -TCA TGT TTG ACA GCT TAT CAT-3'. 



Thie gives rise to a fragnent with the insult) precursor gene and the pertinent 
5'^nd region has a cleavage site lor enzyme Xhel and h its 3'-end region a cleavage ste ^T^^"- T;*J^!SZitSl 
with 1hei««above.martioned enzymes and purified. Plasmid plNT69d is than line^^ 
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p^iuced are dephosphonr^led ^ phosphatase (f^n ca» intsst.e) and .iga.ed wtth the Ira.men, .on, the PCR 
reaction toyleW plasmid pirf173d p,^^^ p,NT72d (Example 9) 



gitamole 11 

Construction of plasmid plNL41d 



tac-promoter fragment to yield plasmid plNL4ld. 
gxamDle12 

Construction of plasmid pL4lc 

P^s.« PPL^MB (wh«, can be o«a.ed 1«m Pham«c.) has a .an*da^-P«xnC»r region. The «.er . 



flanked by nucleotide sequences: 



5 ' GATCTCTCACCTACCAAACAAT3 ' 
and 

5 ' AGerAACTGACAGGAGAATCC3 ' • 



} 



Oligonucleotides 



LPL3: 5' ATGAATTCGATCTCTCACCTACCAAACAAT 
and 

LPL4: 



TTGCCATGGGGATTCTCCTGTCAGTTAGCT 3^ 



Construction of plasmid pL4ld 



• » i~,»i-rfh«h«flntli« resistance flsne and the fusion protein gene In plasmid pL4lc 
The trp-transcription tem,.nator ^^^^^T^^^^^^^ N4830-1 ). For this reason, a polyctetronlc 
18 not effective in strains tfiat are suitable for lermerrtj^^ To prevent this side reaction. 

mRNAand with ita brgequantay on^felance ge^^^^ 

thettp-tem^lnalor sequence IS replaced byane^^^^^ 
MAhasastnictureslmllartothatrtplasmldpllWld^rth^^^^ 

andanrmB-temi^torsequenceff^m^^ 

SimrdSrr^-r-v^^^^ .*sf«gm«,tco^.ingtheinsu.ingsne« 
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Isdated. The t«o iaolated fragments are then ligated to yield plasmid pL41 d. 
Ficample 14 

Conetrudlon of plasmid pINTU 

ligated. toyield plasmid pINTU. with thelollowing twosynthBlB oligonucleotides 

BBtBII Acci ScoW Kpnl BMiHI 

5' C6C6CCTOCTTACCTMAWTATACTACCAATTOCAOCTOWTRCCCGWATCC 
3. 0GACaAt06»eCTCCATATCATGCITA»CCIC0A<KM»TCWWCCCTAG0 

SphI «>al 
CTGCAOGCATCCRACCTTGICTAGAC -3' 
GACGTCCCTACCTTCGAACACATCTGAGCT-5 ' 
PBtl Hindlll (Sail). 

EmiTiDle IS 
to DNA sequence VI 



llco ecoRt Hindin BwHI Kbsl 

Vl-1a 



5' 
3, 



IS 
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a PrBpiiiati«i' ^"i'«h'««"hieh contain DNAffMuanca VI 

nwA ««u«nca VI maw be orepared by standard techniques. The conimerclally available plasmid pUCIfl (or 
urTMSTor MI^DlS) S^iiththe restriction enzymes EcoRI/Hlndl II as stated by the rrjanutecujrer. 

h P»«ai«tlfln d hvhrid ptaemi f fa ■^'"h °«rtain the nene fraqmente HMG I to HMG IV. 

i) Plasmid containing the gene tragment HMG I 

TheplasmidpSU18i8CUtopen*i1hth6««trlctionenzyTne.EcoRlandNcolna^^ 
1 5 (J ^ ^ llgated Z the gene tmgment I «*.lch has prevtousV been phosphorylated. 

ii) Plasmid containing the gene tragment HMG II 

III) Plasmid containing the gene fragment HMG III 

been opened with HIndlll/BamHI. 

IV) PlaamkJ containing the gene tragment HMQ IV 

gene fragment IV is obtained. 

, ^^et„.Mi»n nf iha hybrid o to »mM pUHIO which oontains thg PNA WWanoa V 

Thehybridpbsrndwith.hegenefraBmentHMQIi.open*! with EooB^^ 
II which 18 obtaTned by restriction enzyme digestion of the corresponding hybrid plasmid with EcoHiyHndH . T^ere 

rd=;rsritr^^^^^ 

""trSSSrcS^BsEI. and Xba. and the large Ira^ent . ^ted. and - ^^jj-''^^ 
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10 



argininefollo^sme first eigmamhoacids in the balla8l8equer«iolplNT41d(Tablal^ 

»i meTmctural gene of the acttve domain ol HMG CoA-reduelase. For purposes of comparison the two Inltta^ 
nf^SlaSXeS^«Ik»andXbalandthe 

S 2^. im.^ieir^er the start codon. the active domain of HMG CoA^uctase (h accordance w«h DNA 

• ^^^^JZ^^ emiKAM Dfotein of approximately 55 KDa Is detemtlned h the supernatant by gel elec- 
ir^rf rZTorm e^ton^r^^ intensive here tt«n tor the protein a^-seddire<rtV Ind^^ 

TxprlL ^c::^ g^aS^^^^ governed by the concenUatton. Th. ind^ better expres8.on of the .us.on 
protein by a factor of at least 100. 
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FiramplftlS 

Construction of plasmid pB70 

Plaemkl pllsfT41d is split wrth Mlul and Sail and the large fragment is isolated. Plasmid plK4 shown in figure 3a 
rvmtAinfi a aena for 'minl-oroinsulin.' the C chain of which consists of arglnlne only. ^ ^ . 

r^t^li 01 Si plasmw has prevtously been deserved In EP-A 0.347.781 (herein 'nco^at^i by refe^ 
encersSSv. the commereiki plasmid pUC19 is opened using the restriction enzymes '<P"' ^. 
ire^Sent ffloure 3-(1)) is separated through a 0.8% strength 'Seaplaque" gel. Thie Iragmenl la reacted with T4 DNA 
liS^liig Jle mUm^H^)) eynthLed accoolhg to Table 4. Table 4 sh«»s the sequence ol gene Iragment 

on |J?gS"^« wh«h cont^n 20 mg/1 of ampteillia The pbsm«J DNA « 

characterize by restriction and DNA sequence analysis. The desired plasmlds are called pIKI (figure 3). 

SS?gv! ihTDNA (figure 3K5))^ccording to Table 5 is l^ed hto pUCI 9 which ha. been opened us^g Petl 

and Hindlll (Rgure 3-(4)). TTie plasmid plK2 (figure 3) Is obtained. . , *,„ nb«niri« niKi and 

The DNA sequences (2) and (5) of figure 3 according to Table 4 and 5 are reisolated from the Pla^mids pll<1 and 
pll<2^°i^rX3ci9. «*ilih has been opened using Kpnl and Hindlll (figure 3^7)). The ptosmid plK3 (figure 
la thus Obtained Which erwodes tor a modified human msulh sequence. ^ • ,• »^ 

* TepSllSte opened using Mlul and Spel and the large fragment (figure 3a-(9)) is «olated. Th» « Lgated 
With the DNA sequence (10) 

B30 Al A2 A3 A4 A5 A6 A7 X8 A9 

(Thr)(Arq) Cly !!• Val Clu GXn Cye Cym (Thr) (Ser) (10) 

5' OG CGT GGT ATC GTT CAA CAA TGT TGT A ^ 

3' A CCA TAG CAA CTT GTT ACA ACA TGA TC 5' 

(Klul) <^P*^> 

Which supplements the last codon Of the B Chain (B30) one argN^^ 

flTst 7 amino acids of the A chain and eupplements the codor^ tor the amino acids 8 and 9 of this chain. The plasmid 
nl iCA, ffiaure 3a1 is thus obtained which encodes for human mini-prohsulin. 
' iS^Ta^ 5. thee- and A-chalns Of me insulin molecule are in each 

acid. n1^ to the coding region in gene fra^ent IK II. there is a cleavage sit. for San which will be utilized n the 

'^"^'SiSr^iJ^ with Hpal and Sail and the gene oodng ■mW^.ro^isulin' « isolated. This gene ie ligated wHh 
the above-mentioned large f raffnent of plNT41d and the following synthetic DNA sequence. 













Met 


Gly 


Arg 


Phe 




ATG 


GGC 


CGT 


TTC 


GTT 


TAC 


CCG 


GCA 


AAG 


CAA 
(Hpal) 



(Mlul) 

This gives rise to plasmid pB70. which codes a fustan proteh In which the baBast sequence f^^;;^^*^^'^ 
by amino acid sequence Mel-Gly-Arg which is followed by the amino acid sequence of the -hinnOTinsulin . 
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£0 



30 



TABLE 4; Gene fragment IK I (2) 



Phe 

10 20 30 40 



10 <- 



CT TTG GAC AAG ACA TTC GTT AAC CAA CAC TTG TGT GGT TCT CAC 
CAT GGA AAC CTG TTC TCT AAC CAA TTG GTT CTC AAC ACA CCA AGA CTG 

< 1 > 

IS (Kpnl) "P*^ 



50 



60 70 80 90 



— > <— — 

TTG GTG GAA GCC TTG TAG TTG GTT TGT GGT GAG CGT GGT TTC TTC 
AAC CAC CTT CGC AAC ATG AAC CAA ACA CCA CTC CCA CCA AAG AAC 

Thr Arg Lya Gly Ser Leu 
100 110 120 



TAC ACT CCA AAG ACG CGT AAG CGT TCT CTG CA 
ATG TGA GGT TTC TGC GCA TTC CCA AGA G 
— — ... -> 



40 



45 



SO 



ss 
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Tabl« 5: Gene fragment IK II (5) 

Gin Lya Arg Gly 

130 140 150 160 

6 

^ G AAG CGT GGT ATC GTT GAA CAA TGT TGT ACT A6T ATC TGT TCT 
AC GTC TTC GCA CCA TAG CAA CTT CTT ACA ACA ^ShJSSh TAG ACA ACA 

(P^I)" 

A«^ 
Asn 



10 



20 



2S 



170 180 150 200 210 



TTC TAG CAC CTG GAA AAC TAC TCT AAC TGA TAG TCG ACC CAT GGA 

AAC ATC GTC GAC CTT TTG ATG ACA TTG ACT ACT ACC TGG GTA CCT TCG A 

> 

^^^^^^^^^ 



90 Examoie17 

By using the oligonucleotides listed below there are obtained plaemids pINTQOd to plNT96d ir, anaJo^^^ the 
prevtoua ex^ptes. An asterisk indicates the same encoded amino acid In the bailasi ^^^^ as In pi 
^ PINT92 encodes a double mutation in the insulin derivat^e sncoded by the P^^^^l^^^ 
3S ArgStheendoltheballastconstlluemandlnthe-minlCchah-lssubstrt^^ 
preproduct can be cleaved with cyanogen bromide. 
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pINT90d: ******GNSA* (variant of pINT69d) 
TIR: 5 ' -CTGAAATGAGCTGTTGAC-3 
and 

InsuSO : 5 ' -TGCCGAATTTCCTGTT6ATGTTGTTGC- 
or 

Insu4 9 : 5 ' --GGAAATTCGGCACGATTTGTGAACCAG- 
and 

InSUll : 5 ' -TCATGTTTGACAGCTTATCAT-3 ' 



pINT9ld: ******GNSA* (variant of pINT72d) 
TIR; 5 ' -CTGAAATGAGCTGTTGAC-3 ' 
and 

InsuSO ; 5 ' -TGCCGAATTTCCTGTTGATGTTGTTGC 
or 

Insu4 9 : 5 ' -GGAAATTCGGCACGATTTGTGAACCAG 
and 

Insull : 5 ' -TCATGTTTGACAGCTTATCAT-3 ' 



EP04B97aOB1 

pINT92d: (double mutant of pIKT72d) 

InsuS 6 : 5 ' -TCGACCATGGCAACAACATCAACAATGTrTGTG-3 
and 

Insu58 : 5 ' -GATGCCCATGGTCTT-3 ' 
or 

Insu57 : 5 ' -AAGACCATGGGCATC-3 ' 
and 

Insul 1 : 5 ' -TCATGTTTGACAGCTTATCAT-3 ' 



pINT93d: ****** (variant of pINT68d) 

InsuS 3 : 5 ' -ACCATGGCAACAACATCAACAAAACGATTTGTG-3 ' 

and 

Insul 1 : 5 ' -TCATGTTTGACAGCTTATCAT-3 ' 



pINT94d: ****** (variant of pINT68d) 

Insu54 : 5 ' -ACCATGGCAACAACATCAACACCACGATTTGTG-3 ' 

and 

Insul 1 : 5 ' -TCATGTTTGACAGCTTATCAT-3 ' 



pINT95d: ****** (variant of plNT68d) 

Insu55 : 5 ' -TCGACC71TGGCAACAACATCAACAATGCGATTTGTG-3 ' 

and 

Insull : 5 ' -TCATGTTTGACAG<rrTATCAT-3 ^ 



pINT96d: ****** (variant of pINT68d) 

Insu7 1 : 5 ^ -ACCATGGCAACAACATCAACAGGACGATTTGTG-3 ' 

and 

Insull : 5 ' -TCATGTTTGACAGCTTATCAT-3 ' 



Claims 



1. A piocaBs lor the preparation pf fusion proleins. which fusion proteins contain a desired protein and a ballast 
constltuerrt, which process conprlses 
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(a) constructing a mixed oligonucleotide which codes lor the said Isallast constituent, wherein the said oligo- 
nucleotldo contelns tho DNA sequence (coding strand) 

5 (DCD). 
in which D is A, G orTandx is 4to 12, 

(b) inserting the said mixed oligonucleotide into a vector so that It Is lunctlonally linked to a regulatory region 
10 and to the structural gene coding tor the said desired protein, 

(c) transforming host cells with the so^tained vector population and . u • 

(d) selecting from the transformants one or more clones expressing a fusion protein in high yield. 

2 The process as claimed in claim 1 . wherein the said ollgonucleodle codes al its 3r end of the coding strand for an 
IS ' amir.0 acid or for a group of amino acids which allows an easy cleavage of the said desired protein Irom the sad 
ballast constituent 

3. The process as claimed in claim 2. wherein said cleavage Is an enzymatic cleavage. 

20 4, The process as claimed in claim i, wherein the said oligonucleotide is designed so that it leads to a fusi^ 
which Is soluble or which easily can be solublllzed. 

5. The process as claimed in claim 1 . wherein the said oligonucleotide is designed so that the ballast constituent 
does not interfere with folding of the said desired protein. 

25 

6. The process as claimed In ctelm 1 . wherein x Is 4 to 8, 

7. The process as clalned In claim 5. wherein the said oligonucleotide has the sequence (coding strand) 

30 

ATG (DCD)y(NNN)2 

36 wherein N h the NNN triplet stands lor identical or different nucleotides, excluding slop codons for NNN, z Is 1 to 
4 and y + z is 61o 12, y being at least 4. 

8. The process as claimed In claim 7. wherein y + z Is 6 to 10. 

40 9. The process as claimed in claim 7. wherein y is 5 to 8 and z is 1. 

ia The process as clahied In claim 1 , wherein the said oligonucleotide has the sequence (coding strand) 

45 ATG GCW (DCD)^^ CGW 

in which Wis A or T. 



so 



66 



Pfltontaneprflcho 

1. Verfahren zur Herstellung von Fusionsproteinen, weiche Fusionsprotaine ein gewOnschtes Protein und einsn Bai- 
lastbestandtsli enthatten, welches Verfahren umfadt 

(a) das Konslmisran einss gamischten Oligonucleotids. welches f Qr den genannten Baliastbestandleil codlert, 
wobei das genannte Ollgonucleolid die DNA-Sequenz (codlerender Strang) 
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(DCD)x 

enthait, worin D fflr A, G Oder T sieht und x von 4 bla 12 bertrflgi; 

(b) das Insertieren des genannten gamischlen Oligonucleotids in einen Nfektor. so dafl dieses an eine ragu- 
latof ische Region und an das Strukturgen. welches IQr das gewunschle Protein codlert. lunkllonell gebunden 

(c) das Transformieren der Wirtszellen mit dar so arhaltenen Vsktorpopulation; und 

(d) das Selekiieren von einem oder mehreren Klonen. welche ein Fusionsprotein in hoher Ausbeute exprlmle- 
ren, aus den Transformantsn. 

2 Varfahren nach Anspruch 1 , worin das genannte Oligonucleotid an seinam 3'^nde des codierenden Stranges fQr 
eine Aminosaure oder sine Gmppe von Aminosauren codiart. wodurch eine leichta Spaltung des gewQnschten 
Proteins von dem genannten Ballastbestandtell emndgilcht wird. 

3. Vertahren nach Anspruch 2, worin die genannte Spaltung eine enzymatlsche Spaltung 1st. 

4. Vertahren nach Anspnjch 1 . worin das genannte Ollgonucleodd so ausgestaltet 1st. dafl es zu einem F uslonsprotem 
f Ohrt. welches Wslich isl oder welches lelcht solublllslert weiden Kann. 

5. Vertahren nach Anspruch 1 , worm das genannte Oligonucleodd so ausgestaltet ist. daB der Ballastbestandtell die 
Faltung das genannten gawflnschten Protaina nicht beertrachligl. 

6. Vertahren nach Anspruch 1, worin X von 4 bis a batragt 

7. Vertahren nach Anspruch 5. worin das genannte Oilgonucleolld die Sequenz (codlerender Strang) 

ATG(DCD)y(NNN) ^ 

besltzt. worin N Im NNN-Triplelt lOr Identische Oder verschledene NuWeotlde steht. wobel Slopcodons lOr NNN 
ausgeechlossan sind, z von 1 bis 4 betragt und y+z von 6 bis 12 betragt, wobei y mindestens 4 lat. 

8. Vertahren nach Anspruch 7, worin y+z von 6 bis 10 betragt. 

9. Vertahren nach Anspruch 7. worin y von 5 bis 8 betragt und z 1 ist 

10. Vertahren nach Anspruch 1. worin das genannte aigonucleotid die Sequenz (codlerender Strang) 

ATG GCW (DCD)4^9 CGW 

besltzt. worin W A oder T ist. 
Revendlcations 

1 . Procddd pour la prdparation de protdines de fusion, lesquelles protdlnas de lusion contiennant une proline re- 
cherchde et un constituant de lestage, (equal procddd comprend 

(a) la conatmctlcn dun oligonucidotide mixte codant pour ledit constituant da lestage, ledit ollgonucldollde 
contenani la sequence d*ADN (brin codant) 

(DCD)^ 

dans laquelle DestA.QouTetxvade4^12, 

(b) rnaartion dudit ollgonucldolida mate dans un vectaur, da manifere qu'il soil fonctionnallemant Ii6 k una 
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rdgion rdgulatrica et au g6ne structural codant pour ladite protdine recherchde. 

(c) la trarwfoffnatlon do cellules hOtee par la population de vecteurs airwl obtenue ©t 

(d) la sdlsctbn. k partir des transformants. d'un ou plusisurs dones exprimant una protdine de lusion avec un 
rendement 6l6v6. 

Proc6d6 selon la revendlcatlon 1 , dans lequel ledrt oIlQonucl^otide code h son extf6nrill6 3* du brin codant pour un 
aminoaclde ou pour un groupe d'anninoacldee pemiettant une separation alsde de ladite protdine recherchee 
cf avec ledit constituant de lestage. 

Procddd selon la ravendication 2, dans lequel ladite separation est une coupure enzymatique. 

Procddd selon la ravendication 1 , dans Isquel ledit oligonucldotlde est conpu de manl&ra k conduira k una protdine 
de lusion qui set soluble ou qui peut dtre aisdnnent solubilisda. 

Procddd selon la ravendication 1, dans lequel ledit oligonucleotide est congu de maniftre qua la constituant da 
lestage n'lnterf6re pas avec le repllemem de ladite protdine recherchde. 

Procddd selon la ravendication 1 . dans lequel x va da 4 d 8. 

Procddd selon la ravendication 5, dans lequel ledit digonucldolide comporte la sdquence (brin codant) 

ATG (DCD)y (NNN)jj 

dans laquelle N dans le triplet NNN reprdsante des nucldotldes identiques ou dilfdrants. k I'axclusion des oodons 
tfarrdtpoufNNN.zvadel a4ety + zvade6ai2.ydtantau moins dgal&4. 

Procddd selon la ravendication 7, dans lequel y + z va de 6 6 10. 

Piocddd salon la ravendication 7. dans lequel yvadeSftBetzeatdgaldl. 

Procddd salon la ravendication 1 , dans lequel ledit oligonucldollde comporte la sdquence (brin codant) 

ATG GCW (DCD)^_g CGW 

dans laquelle W est A ou T. 
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